Identification of CT-based non-invasive radiomic biomarkers for overall survival prediction in oral cavity squamous cell carcinoma

This study addresses the limited non-invasive tools for Oral Cavity Squamous Cell Carcinoma (OSCC) survival prediction by identifying Computed Tomography (CT)-based biomarkers to improve prognosis prediction. A retrospective analysis was conducted on data from 149 OSCC patients, including CT radiomics and clinical information. An ensemble approach involving correlation analysis, score screening, and the Sparse-L1 algorithm was used to select functional features, which were then used to build Cox Proportional Hazards models (CPH). Our CPH achieved a 0.70 concordance index in testing. The model identified two CT-based radiomics features, Gradient-Neighboring-Gray-Tone-Difference-Matrix-Strength (GNS) and normalized-Wavelet-LLL-Gray-Level-Dependence-Matrix-Large-Dependence-High-Gray-Level-Emphasis (HLE), as well as stage and alcohol usage, as survival biomarkers. The GNS group with values above 14 showed a hazard ratio of 0.12 and a 3-year survival rate of about 90%. Conversely, the GNS group with values less than or equal to 14 had a 49% survival rate. For normalized HLE, the high-end group (HLE > − 0.415) had a hazard ratio of 2.41, resulting in a 3-year survival rate of 70%, while the low-end group (HLE ≤ − 0.415) had a 36% survival rate. These findings contribute to our knowledge of how radiomics can be used to predict the outcome so that treatment plans can be tailored for patients people with OSCC to improve their survival.

represents alcohol usage status, with three categories: "Alcohol user" (category 1, reference level), "Alcohol nonuser" (category 2), and "Unknown" (category 3).The average p-value for the tenfold cross-validation of the nonuser categroy is 0.084, with a standard deviation of 0.057 This suggests that alcohol usage may have a significant influence on the model, although the effect might not be as robust as the continuous variables, given there are 18 missing values in ETOH.Table 2 shows the final CPH model parameter estimates and goodness of fit statistics.For instance, the hazard ratios in column H.ratio provide insight into the risk effect of each factor.For the clinical ETOH effects, a hazard ratio of 0.54 indicates that non-users have a hazard of 0.54 times that of users.In terms of odds, the probability of death occurring (P) can be calculated by P = H.ratio / (1 + H.ratio).For alcohol users, there is a 65% chance of death, while for non-users, there is a 35% chance of death.The 95% confidence interval (CI) for the effect of non-users lies between 0.29 and 1.01, indicating an acceptable variability in the hazard ratio of 0.54.We also investigated the association between stage, treated as a ordinal variable, and survival time.The hazard ratio for stages is 1.37, indicating a 37% increase in risk when moving from one stage to the next.

Texture analysis
By standardizing HLE to HLE s (zero mean with a standard deviation of 1), the Hazard ratio of 1.29 for HLE s indicates that the chance of death increases by 1.29 times for patients with one standard deviation higher HLE s compared to the previous HLE s .In other words, for each unit increase in HLE s , there is a 14% increase in the chance of death compared to the previous one.The lower bound of 95% confidence interval [1.05, 1.58] is also greater than 1, further suggesting that HLE s is a significant risk factor for overall survival in OSCC.On the contrary, GNS has both a hazard ratio of less than 1 and a 95% confidence interval.The hazard ratio of 0.94 suggests a 52% chance of death for a patient with one unit increase in GNS, compared to a 48% chance of death for a patient with no increase in GNS.Interestingly, even when we include the stage as a covariate in the multivariate analysis, we still identify these two significant radiomics features.This suggests that there are distinct survival differences associated with these features, as demonstrated by the Kaplan-Meier estimator.
The NGTDM contains information about the average grey level of a voxel within a pixel neighborhood, and it also stores information regarding spatial changes in intensity.Strength is one of the five key perceptual attributes of texture, which indicates the ability of elements to be easily distinguished.GNS serves as an indicator of the distinctiveness of the tumor textures.A high GNS value corresponds to a strong texture.In study 39 , a significant correlation was found between GNS and coarseness.GLDM quantifies the amount of local variation present in an image 40 .HLE measures the distribution of large dependence with higher gray level intensities, which has been associated with the heterogeneity of tumors 41,42 .

Stratification analysis
The Hazard Ratio in Fig. 1 illustrates the impact of two radiomics features HLE and GNS, on each stage, stratified by ETOH and cancer stage while keeping other covariates fixed at their sample means.The KM curves are stratified based on drinking status (drinker and non-drinker) across different cancer stages.Each curve represents the estimated hazard ratio for individuals in a specific group (e.g., drinkers with stage 1 OSCC, and nondrinkers with stage 2 OSCC).The vertical axis represents the hazard ratio, while the horizontal axes represent the radiomics features values.The left plot demonstrates that a higher Hazard Ratio is linked to advanced stage status, with a positive correlation between HLE and Hazard Ratio when other factors are held constant.Alcohol users (ETOH = 1) exhibit a consistently higher hazard ratio across all stages than non-users (ETOH = 2).In the www.nature.com/scientificreports/right plot, we observe a negative relationship between GNS and Hazard Ratio.The hazard ratio shows a steadier change for non-users at each.Several trends are notable in this comprehensive dataset reflecting the clinical characteristics of two stratified cohorts, respectively, designated as HLE l vs HLE h and GNS l vs GNS h .The low-end group (GNS l ) consists of patients with a GNS value less than or equal to 14, while the high-end group (GNS h ) comprises patients with GNS values greater than 14.Similarly, for the normalized HLE (HLE s ) feature, we formed two groups: the low-end group (HLE l ) with an HLE s value less than or equal to − 0.415 and the high-end group (HLE h ) with HLE s values greater than − 0.415.This stratification allows us to identify differences in outcomes and treatment response between the groups, as well as explore the potential predictive value of these radiomics features.Predominantly male participation is observed in both cohorts, with the low-end cohort significantly outweighing the high-end cohort.The mean age at diagnosis is almost identical in both cohorts.As for lifestyle habits, low-end cohorts have Table 2. Final Cox model fitting on the sample data.95% CI is calculated by exp(coef ± 1.96 × se).Both tests yielded significant p-values across all folds, with average p-values of 3e−06 and 6e−06 for the log-likelihood ratio and the score test, respectively, suggesting that our model is highly significant and provides a good fit to the data.The training concordance index (CI) remained stable and high across all iterations, with an average value of 0.727 (SD = 0.011).The testing CI has a mean value of 0.699 and a standard deviation of 0.103.In a simlar vien, the average training CI for absolute survival is 0.751 and testing CI is 0.706.www.nature.com/scientificreports/ a slightly higher percentage of smokers.Regarding alcohol consumption, a larger 73% of GNS h participants were alcohol users compared to 45% in GNS l .The alcohol consumption rate was distributed evenly in HLE groups.Figure 2 displays the Kaplan-Meier curves for each feature.Stratification by GNS reveals a significant difference both in overall survival and absolute survival between the two groups.The group with GNS greater than 14 shows a flat overall survival curve with only two events out of 24 risks and absolue survival curve with only 1 event out of 23, suggesting that this group generally has a good prognosis with a high overall survival probability over the follow-up period.In contrast, the median overall survival for the group with GNS less than 14 is approximately 37 months, with 64 events out of 125.The 3-year overall and absolute survival rate for the group with GNS greater than 14 are 90% and 96%, compared to 49% and 57% in the group with GNS less than 14.The median overall survival for the HLE l group is 154 months, compared to 10 months for the HLE h group.The 3-year overall and absoulte survival rate are 60% and 75% for the former group and 23% and 49% for the latter.

Radiomics-based nomogram
Based on our model, we developed a nomogram in Fig. 3 that visually represents the CPH model presented in Table 2.This nomogram allows the estimation of overall survival for OSCC patients after treatment.To use the nomogram, one simply needs to input the values of four variables (HLE, GNS, stage, and ETOH) and mark them on their respective axes.Connecting these marked values with vertical lines to the top scale (points scale) determines the points for each variable.Adding these points together and marking them on the total points axis provides the total points.By connecting the position of total points with the corresponding survival probability, one can estimate the overall outcomes based on the Linear Predictor.
The purpose of the nomogram in thIs study is to provide a practical and user-friendly tool for estimating overall survival in OSCC patients after treatment.By integrating multiple prognostic factors into a graphical representation, the nomogram allows healthcare professionals to easily assess individual patient outcomes and make informed decisions regarding treatment strategies.The benefit of using a nomogram lies in its ability to incorporate complex statistical models into a visually intuitive format, enabling personalized risk prediction.It offers improved prognostic accuracy, individualized treatment planning, and enhanced communication between healthcare providers and patients.The nomogram serves as a valuable addition to clinical practice by facilitating shared decision-making and promoting precision medicine approaches in the management of OSCC.

Discussion
The present study aimed to evaluate the prognostic value of radiomics features in oral cavity squamous cell carcinoma (OSCC) patients.Our findings demonstrate that radiomics analysis of pre-treatment CT scans can provide valuable insights into the factors influencing survival and serve as prognostic biomarkers in this patient population.
Key findings of our study include two significant hazard ratios, 1.29 and 0.94, between two radiomics features and overall survival.The concordance-index (CI) showed a stable and high average value of 0.7, indicating good predictive accuracy of the prognostic model.These results highlight the potential of medical imaging, particularly radiomics, as a non-invasive and quantitative method for treatment prognosis.Methodological considerations were also addressed in our study.We standardized the voxel spacing across patients by resampling CT images to ensure accurate feature calculation.Additionally, gray-level normalization was applied to enhance the comparability of features and improve their robustness against variations in different settings.These steps are crucial for reliable and reproducible radiomics analysis.The feature selection process in our study involved sequential approaches consisting of Correlation Analysis, Score screening, and the SparseL1 algorithm.This process effectively reduced the dimensionality of the feature space while retaining a significant amount of the prognostic information present in the original data.This approach helps to mitigate issues such as bias, overfitting, and multicollinearity that can arise in high-dimensional data analysis.Best subset selection modeling techniques were utilized to identify optimal Cox proportional hazards models, leading to the identification of biomarkers associated with survival in OSCC patients.
The Cox proportional hazards modeling revealed several significant radiomics features associated with survival in OSCC patients.The continuous variables, Gradient-NGTDM-Strength (GNS) and Wavelet-LLL-GLDM-LargeDependenceHighGrayLevelEmphasis (HLE), showed varying levels of significance in the model, indicating their potential as prognostic biomarkers.
The categorical variable, alcohol consumption (ETOH = 2), also demonstrated some degree of influence on the model.The performance of our Cox model was assessed using log-likelihood ratio and score tests, which consistently yielded small p-values across all folds in the tenfold cross-validation.The concordance index (CI), a measure of predictive accuracy, remained stable and high, indicating the effectiveness of our model.These results suggest that our Cox model provides robust predictive performance for survival in OSCC patients.
While our study highlights the potential of radiomics in OSCC prognostication, it is important to acknowledge its limitations.The relatively small sample size and the nature of the survival study may impact the stability of the validated radiomics features in our model.To support a low-biased and variance survival model with four effects, it is recommended to have at least 40 events in each training set, requiring a sample size containing 67 events if 60% is allocated for the training set.This criterion restricts the degrees of freedom our model can reach, potentially affecting the prognostic ability of the underlying radiomics.Additionally, the current analysis focused on extracting radiomics features from a single imaging modality, i.e.CT.Future studies are warranted to investigate radiomics features from multiple modalities, such as CT and MRI, which opens up the potential to improve the prediction accuracy further.Last, the significant association between certain radiomics features and overall survival suggests that imaging features may reflect some of the underlying molecular characteristics of the tumors.Future investigations are warranted to integrate genetic TP53 mutations 15 and P16 overexpression 43 and radiomics data to characterize squamous cell carcinoma of the head and neck and provide an alternative non-invasive, multi-modal approach to OSCC outcome predictions.
In conclusion, our study demonstrated the potential of radiomics as an effective tool to predict treatment response in OSCC patients.Incorporating radiomics analysis into clinical practice could improve decision support and enhance patient stratification, reducing both over-treatment and under-treatment to improve outcomes.The findings from the study pave the way for future investigations through a larger clinical trial to further evaluate the clinical efficacy of radiomics biomarkers for overall survival prediction for OSCC patients.

Endpoints of interest and study cohorts
This retrospective cohort study examines a group of oral cavity squamous cell carcinoma (OSCC) patients who underwent contrast-enhanced CT scans at the institution between 2006 and 2017.The sample size consisted of 149 patients.We collected six clinical attributes, including age at diagnosis, gender, tobacco use, alcohol consumption, stage, and race, summarized in Table 3. Table 1 presents a comprehensive summary of the clinical factors observed in this cohort.The mean age at the diagnosis was 62, ranging from 29 to 98 years' old.Patients were categorized into four stages (I, II, III, and IV) based on the pathological assays of tumor specimens.Smoking and alcohol status were self-reported and coded as 1 for yes and 2 for no.The missing values for smoking and alcohol status were hard-coded as 3 due to their substantial representation within the dataset.Six treatment modalities, including chemoradiotherapy (CRT), chemotherapy (CT), surgery (Sx), radiotherapy plus surgery (RT + Sx), CRT + Sx, and CT + Sx, were administered to patients as their initial treatment.The endpoint in this study was overall survival (OS), defined as the time from the date of diagnosis (determined by the diagnostic scan or biopsy) to the date of death or last follow-up day.As of November 04, 2019, a total of 66 patients died after treatment.The average survival time among all 149 patients was 40 months (ranging from 1 to 154 months).Among the patients who died, the average survival time was 19 months (ranging from 2 to 154 months), whereas among patients alive at the last follow-up the average survival time was 58 months (ranging from 1 to 137 months).We also compared overall survival to absolute survival defined as the proportion of patients who were alive after primary surgery.

Data preparation and overall workflow
This study aims to enhance the prognostic and predictive value of radiomics for OSCC patients by extracting1092 radiomics features from pre-treatment CT scans.The primary objective is to identify prognostic and predictive biomarkers within these CT scans that can facilitate the assessment of treatment effectiveness at the individual patient level.This will enable the selection of tailored treatment strategies and ultimately lead to improved patient outcomes.The workflow outlining our approach is illustrated in Fig. 4. In this workflow, the tumor volume serves as the region of interest (ROI) from which all radiomics features are computed (as shown in Fig. 5).The contouring of the ROI was performed manually by experienced Radiation Oncologists using the Varian Medical System Eclipse software environment.These features underwent a selection process to minimize redundancy and were combined with clinical data.A CPH model, optimized via the best subset approach and tenfold cross-validation, was then applied.The model's predictive performance was evaluated using the concordance index.All statistical analyses were performed using R programming language, with a significance level (alpha) set at 0.05 for all tests.All procedures included in this application have been approved previously by the institutional review boards (IRB) of the University of Maryland School of Medicine Institutional Review Board.

Pre-processing
It is worth noting that a previous study 44 highlights that radiomics features are sensitive to voxel size.Therefore, maintaining consistent voxel sizes across patients is crucial to obtain accurate and reliable radiomics feature calculations.In this study, CT resolution sizes varied from 0.3 × 0.3 × 0.5 to 1.3 × 1.3 × 5. We resampled all CT images to the resolution of 1 × 1 × 1 mm 3 using the basis spline algorithm (Bspline) to interpolate the HU values in the resampled voxels.Correspondingly, we used the nearest neighbor algorithm to resample the tumor contours to the same resolution.The subsequent procedure, gray level normalization, is critical in improving the comparability and robustness of radiomics features across different settings and variations among patients.As demonstrated in a previous study 44 , gray level normalization reduces the variance and enhances the robustness of radiomics features, particularly regarding varying discretization levels.Therefore, normalization was applied to scale the Hounsfield Unit (HU) values to a uniform range across patients.This was achieved by subtracting the mean and dividing the voxel values by the standard deviation.Conventionally, this process yields an ROI with intensities approximately in the range [− 3, 3] after removing outliers outside three standard deviations.These resulting values are then further scaled using a normalized scale parameter, such as a value of 100, resulting in an approximate range of [− 300, 300].Finally, the intensities within the ROI were discretized using a unified binwidth of 5, starting from the minimum normalized HU value of 0. In this study, we selected a bin width of 5 to ensure an adequate number of bins (between 1 and 400) for capturing more granular textural information 45 .This   ) + 1 .This discretization approach offers the advantage of noise suppression and improved robustness of radiomics features.

Feature extraction
Features extracted from medical images carry the phenotypic characteristics of a tumor, as shown in Fig. 6.A typical medical image features data set involves measurements of tens of thousands of voxel intensities for a single tumor sample; usually, the number of features ( p ) is ≫ sample size ( n ).In this study, feature extraction was performed in each set of images using the Python library PyRadiomics 46 .Imaging Biomarker Standardization Initiative (IBSI) 47 described features were extracted in six families, including shape-based, first-order statistics, gray level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM) 48 , grey level size zone matrix (GLSZM) 49,50 , gray level dependence matrix (GLDM) 51 and neighborhood grey tone difference matrix (NGTDM) 39 .Additionally, features are calculated from wavelet, Laplacian of Gaussian (Log), square, square root, logarithm, exponential, and gradient-filtered images, making the total number of features 1092.

Motivation and overview
In the field of time-to-event data analysis, the Lasso-Cox model 52 has gained popularity for its efficiency and simplicity in modeling high-dimensional data 53 .This model allows for simultaneous regression and variable selection, making it a preferred choice in handling high-dimensional data.However, this algorithm often falls short of providing accurate predictive results.The regularization parameter in the Lasso-Cox model emphasizes penalization on large coefficients, leading to potential bias and an overly simplistic model that may underfit the data.To mitigate this issue and improve prediction accuracy, an alternative model called the Elastic-Net was introduced 54 .The Elastic-Net model incorporates both ℓ 1 and ℓ 2 penalties in ordinary least squares estimation and has been extended to handle time-to-event data 55 .However, the increased complexity of the Elastic-Net model poses challenges in searching for optimal hyperparameters, requiring additional validation or resampling techniques.This increases the computational complexity and potentially results in solutions trapped in local optima.Another challenge in radiomics data is high multicollinearity, which occurs when there is a high correlation between two or more measurements in the data.For example, sphericity, minor axis length, max axis length and elongation are variables that exhibit strong multicollinearity in the sense that elongation is merely the inverse of spherical disproportion, and elongation is the ratio of the minor axis length to the max axis length.Multicollinearity can lead to the phenomenon where a variable is not deemed significant when correlated features are also present in the model.While regularization modeling techniques can partially mitigate multicollinearity, they may struggle when dealing with highly correlated variables.
Regression modeling often suffers from bias, overfitting, and numerically unstable estimation in cases where the number of predictors (p) is much larger than the number of samples (n).Therefore, feature selection plays a crucial role in this study.From a practical standpoint, it is desirable to build parsimonious prognostic models that are both effective and easy to use for healthcare professionals.This challenge also extends to Cox regression for time-to-event data.Studies [56][57][58] have emphasized the importance of developing parsimonious Cox models.We propose a feature selection approach to optimize the Cox Proportional Hazard model, consisting of several steps outlined in Table 4. Firstly, we pruned the highly correlated features.

Pearson correlation analysis
Given that radiomics features exhibit strong multicollinearity, relying solely on regularization modeling can introduce bias.There is now a substantial body of research on mitigating multicollinearity, such as Principal Component Analysis (PCA), Sparse PCA 59 , and Kernel PCA (KPCA) 60,61 .To mitigate multicollinearity, we first employed Pearson's correlation coefficient to detect linear dependencies among radiomics features.The Pearson correlation ranges from − 1 to 1, with a value of 0 indicating no linear correlation.In the medical field, a Pearson's score of 0.7 suggests a moderate agreement between two features based on previous studies 62,63 .Features with a Pearson's score exceeding 0.7 were pruned, resulting in 79 features for subsequent analysis.The result of pruning is illustrated in Fig. 7. Figure 7 uses a color scheme where white represents no correlation, blue represents a perfect negative correlation, and red represents a perfect positive correlation.The left diagonal map illustrates the correlation coefficients prior to the feature selection process, revealing the initial relationships between features.The right diagonal map presents the correlation coefficients after highly correlated features have been removed, demonstrating the outcome of the feature selection process.The left diagonal map initially revealed numerous red and blue shades, indicating strong positive and negative correlations, respectively, among the data.By comparison, the right diagonal map is lighter, indicating these highly correlated features were subsequently removed from the data.To detect correlation among categorical features, we utilized Cramer's V value.Figure 8 is a graphical representation of the Cramer's V between categorical variables in the data, where white (Cramer's V = 0) represents no correlation and red (Cramer's V = 1) represents a perfect correlation.Feature T and Stage were found to be correlated.We dropped the T variable and retained smoke and ETOH, considering that numerous studies have identified smoking and drinking as risk factors.These correlation measures provide insights into the relationships among radiomics features and aid in addressing multicollinearity to ensure more robust and accurate modeling in our study.After pruning, 1013 radiomics features and 1 categorical feature were effectively eliminated, resulting in a total of 85 features for subsequent analysis.

Univariate score test
The univariate Cox score is the most straightforward method for identifying features associated with variability in survival time in time-to-event data analysis 57 .Our focus is on reducing the number of radiomics features.The screening procedure consists of two steps: fitting 79 univariate Cox proportional hazards models for all radiomics features and using the score test statistic to assess the strength of association between each feature and the outcome.We prioritize p-values over setting a threshold for the statistic value.Features with a score test p-value less than or equal to 0.05 were considered significantly associated with the outcome and retained for subsequent analysis, resulting in 17 features for the next step.

SparseL1 selection
To fit a CPH model, it is critical to choose an appropriate degree of freedom that balances the complexity and accuracy of the model.According to 64,65 , a useful heuristic is to limit the number of predictors used in the fitting should be at most 15% of the events in the training sample.This criterion is corroborated by the simulations study in 66 that the prediction error is lower in CPH models that satisfy this condition.In our study, we observed 66 out of 149 events.Therefore, our target degree of freedom in the final model should be at most 4. Subsequently, we employed the Best Subset Selection (BSS) Modeling strategy in order to identify the best Cox Proportional Hazard (CPH) models based on their Concordance Index.BSS is widely recognized as a highly effective strategy for identifying the best parsimonious model, surpassing other strategies such as stepwise selection, forward selection, backward elimination, and Lasso.However, the computational cost associated with BSS limits its practical usage compared to other techniques.Considering fitting 2, 3, 4 degree-of-freedom CPH with an input data of 17 radiomics features plus 6 clinical features, BSS needs to estimate 41,262 coefficients at least.Exhaustively evaluating all possible subsets is computationally infeasible.Therefore, we need to reduce computational efforts by limiting the number of input variables before BSS.In this study, we employed a variation of Principal Component Analysis (PCA) known as the SparseL1 algorithm 38 to constrain the input data.The SparseL1 algorithm approximates the solution vector v by solving the NP-hard problem: where α n×1 represents the representation vector of n observations and v 1×p is a sparse vector in which each coordinate corresponds to a specific radiomics feature.By enforcing regularization, SparseL1 encourages many coordinates to be zero, effectively eliminating redundant features.SparseL1 is less sensitive to outliers compared to PCA, ensuring robust and consistent solutions.Additionally, the sparsity of the subspace can be adjusted using a single parameter λ, which serves as a controller for the number of inputs.Using this algorithm, we were able to reduce the previous 17 radiomics features to 7 features.

Cox modelling and best subset selection
After feature selection, a multivariate Cox proportional hazards model was utilized to model the prognosis for individual patients.The Cox proportional hazards model is a commonly used approach for analyzing time-toevent data and assessing the effects of predictors on survival time.The Cox proportional hazards modeling is concerned with estimating the coefficients in the linear model: where h(t|x i ) is the cumulative hazard function for subject i with m variables, under the assumption the hazard ratio comparing any two observations remains constant over time.The coefficients β were estimated by solving maximizing the partial likelihood function: where E is the set of indices of dead patients and E r is the set of the indices of alive patients at the time t r .We employ the Concordance index as the primary criterion for Best Subset Selection: where |β| 0 is the 0 -norm of β .Finally, a tenfold cross-validation procedure was used to assess how the selected model will generalize to a new data set.The data was split into 10 folds, ensuring an even distribution of status

Figure 2 .
Figure 2. The top four KM curves represent overall survival rates stratified by each factor, while the bottom four KM curves depict the absolute survival stratified by each factor.

Figure 3 .
Figure 3. Nomogram from the fitted Cox model.

Figure 4 .
Figure 4. Image feature extraction and outcome prediction workflow.

Figure 5 .
Figure 5. ROI (red) in the left oral cavity.

Figure 6 .
Figure 6.The typical workflow of radiomics feature extraction.

Figure 7 .
Figure 7.Comparison of Correlation Coefficient Heatmaps: On the left, the diagonal heatmap illustrates the pairwise correlations among features before pruning.On the right, the diagonal heatmap demonstrates the correlations after pruning.The color scale represents the strength of the correlation, with blue indicating negative correlation, red indicating positive correlation, and white representing no correlation.We created the correlation heatmap using the 'ggplot2' package in R version 4.3.1 (Wickham, 2016).

Table 1 .
Tenfold Cross-validation p-value mean and standard deviation of CPH models and variable.

Table 4 .
38ature selection procedure.Next, we selected a subset of features based on their relationship with overall survival.Finally, SparseL138recommended a final active subset, effectively eliminating redundant features to ensure computational feasibility for the Best Subset Selection strategy.